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Field of the Invention 

The invention is generally related to intelligent 
agent computer programs executable on computer systems and 
the like, and in particular, the use of such agents for 
performing specific tasks in a computer system. 

Background of the Invention 

Since the advent of the first electronic computers in 
the 1940' s, computers have continued to handle a greater 
variety of increasingly complex tasks. Advances in 
semiconductors and other hardware components have evolved 
to the point that current low-end desktop computers can 
now handle tasks that once required roomfuls of computers. 

Computer programs, which are essentially the sets of 
instructions that control the operation of a computer to 
perform tasks, have also grown increasingly complex and 
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powerful. While early computer programs were limited to 
performing only basic mathematical calculations, current 
computer programs handle complex tasks such as voice and 
image recognition, predictive analysis and forecasting, 
multimedia presentation, and other tasks that are too 
numerous to mention. 

However, one common characteristic of many computer 
programs is that the programs are typically limited to 
performing tasks in response to specific commands issued 
by an operator or user. A user therefore must often know 
the specific controls, commands, etc. required to perform 
specific tasks. As computer programs become more complex 
and feature rich, users are called upon to learn and 
understand more and more about the programs to take 
advantage of the improved functionality. 

In addition to being more powerful, computers have 
also become more interconnected through private networks 
such as local area networks and wide area networks, and 
through public networks such as the Internet. This 
enables computers and their users to interact and share 
information with one another on a global scale. However, 
the amount of information is increasing at an exponential 
rate, which makes it increasingly difficult for users to 
find specific information. 

As a result of the dramatic increases in the both 
complexity of computer programs and the amount of 
information available to users, substantial interest has 
developed in the area of intelligent agent computer 
programs, also referred to as intelligent agents or simply 
agents, that operate much like software-implemented 
"assistants" to automate and simplify certain tasks in a 
way that hides their complexity from the user. With 
agents, a user may be able to perform tasks without having 
to know specific sequences of commands. Similarly, a user 
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may be able to obtain information without having to know 
exactly how or where to search for the information. 

Intelligent agents are characterized by the concept 
of delegation, where a user, or client, entrusts the 
5 agents to handle tasks with at least a certain degree of 
autonomy. Intelligent agents operate with varying degrees 
of constraints depending upon the amount of autonomy that 
is delegated to them by the user. 

Intelligent agents may also have differing 

10 capabilities in terms of intelligence, mobility, agency, 
and user interface. Intelligence is generally the amount 
of reasoning and decision making that an agent possesses. 
This intelligence can be as simple as following a 
predefined set of rules, or as complex as learning and 

15 adapting based upon a user's objectives and the agent's 

available resources. An intelligent agent's intelligence 
or skill as applied to a specific field or function is 
referred to as domain knowledge. 

Mobility is the ability to be passed through a 

20 network and execute on different computer systems. That 
is, some agents may be designed to stay on one computer 
system and may never be passed to different machines, 
while other agents may be mobile in the sense that they 
are designed to be passed from computer to computer while 

25 performing tasks at different stops along the way. User 
interface defines how an agent interacts with a user, if 
at all. 

Agents have a number of uses in a wide variety of 
applications, including systems and network management, 
30 mobile access and management, information access and 
management, collaboration, messaging, workflow and 
administrative management, and adaptive user interfaces. 
Another important use for agents is in electronic 
commerce, where an agent may be configured to seek out 
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other parties such as other users, computer systems and 
agents, conduct negotiations on behalf of their client, 
and enter into commercial transactions. 

The relative performance of intelligent agents in 
5 handling certain tasks may vary between agents, e.g., 

depending upon the degree of domain knowledge possessed 
thereby. However, an agent which performs well when 
handling a particular task in one instance may not perform 
as well when handling other tasks, or even when handling 

10 the same type of task in other situations and under other 
circumstances. As a typical agent may experience a wide 
variety of situations when it is operating in a real world 
environment, a given agent will not always be able to 
provide its client with optimal performance. Moreover, it 

15 is often difficult, if not impossible, to develop an agent 
that is capable of handling all possible situations with 
equally proficient results. 

For example, in electronic commerce applications, 
security is a major concern. Intelligent agents may be 

20 entrusted with the authority to sell or purchase products 
on behalf of a client, and may be dispatched to any number 
of computer systems in a network to perform negotiations. 
Often, the computer systems to which the agents are 
dispatched are remote to the client's system, and thus not 

25 under the direct control of the client. An agent may thus 
be subject to scrutiny by third parties, either through 
monitoring transmissions between the agent and its client, 
or through actual scanning and reverse compiling of the 
program code of the agent. In some instances, the risk 

30 niay be segregated into transmission risk, which is 

associated with the transmission media between the agent 
and the client, and environment risk, which is associated 
with the environment in which the agent is resident. Each 
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type of risk may present unique concerns for a given 
agent . 

Beyond the clear risk of discovering confidential 
information such as a client's credit card or bank account 
5 number, additional risks are posed when an agent is 

subject to scrutiny. As an example, an agent would be 
placed at a competitive disadvantage if any of its 
negotiation strategies, techniques or plans, or any 
specific instructions from its client, were discovered by 

10 other parties . If a message to an agent from its client 
indicates that the agent should offer $100, but is 
authorized to go as low as $90, another party that 
intercepts this message immediately knows that a 
transaction may be completed at the lower price. Also, 

15 even if the message is not intercepted, if the agent has 
stored the $90 price as its lowest authorized offer, 
reverse compilation or scanning of the agent by another 
party may lead to similar results . 

The degree of domain knowledge such as autonomy 

20 granted to an agent by its client, and the remaining 
domain knowledge retained by the client, may vary 
significantly for different agents. In general, however, 
the more domain knowledge that is imparted to an agent, 
the greater the security risk presented to the agent and 

25 its client. 

Often, a tradeoff must be made between absolutely 
secure agent transactions with prohibitively high costs, 
and less secure agent transactions with acceptable costs. 
The greatest security would be obtained when most, if not 

30 all, of the domain knowledge for conducting negotiations 
is retained by the client, with only a limited 
intermediary agent sent out to relay messages between the 
client and the other party being negotiated with. 
However, this configuration would present significant time 
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and cost burdens on the client since all decision making 
would remain with the client, and since multiple messages 
would likely need to be transmitted between the client and 
the other party. Given these likely burdens, there would 
5 be little, if any, benefit in utilizing an agent at all. 
This would especially be the case in operating 
environments where the security risks are known to be 
small . 

Security risks are also present for other 
10 applications of intelligent agents. For example, a client 
may have developed an agent with a unique search engine 
that greatly outperforms its competitors' designs. In 
certain operating environments, the agent may be more 
likely to be intercepted and copied by competitors — a 
15 situation that the client would prefer to avoid. 

However, without an agent mechanism that appropriately 
balances cost and security, the industry will continue to 
be limited by today's solutions. 
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Summary of the Invention 

The invention addresses these and other problems 
associated with the prior art in providing a manner of 
optimizing the performance of a given computer task with 
5 an intelligent agent by utilizing one or more of a 
plurality of program modules suited to perform the 
computer task but having varied degrees of domain 
knowledge. Based upon an objective criteria that is 
determined for a given situation, an intelligent agent is 

10 configured to execute one or more of the program modules 

to perform the task. Consequently, the performance of the 
computer task by the intelligent agent is optimized for a 
wide variety of situations. 

In an exemplary embodiment of the invention, program 

15 modules having varied levels of autonomy are used to 

conduct negotiations, with the projected security risk for 
a given negotiation being used as the objective criteria 
by which program modules are selected to conduct the 
negotiations. In unknown or untested environments, or in 

20 environments in which known security risks are present, a 
program module which merely acts as an intermediary 
between the client and another party is selected, leaving 
the majority of domain knowledge such as authority to 
complete transactions with the client. For other 

25 environments where the security risk is known to be not as 
great, program modules having more autonomy to conduct 
negotiations and complete transactions independent of the 
client are selected, thereby relieving the client of much 
of the burden of handling these particular negotiations. 

30 However, it should be appreciated that a vast number of 
objective criteria for selecting different agents may be 
used in the alternative, and thus the invention is not 
limited to this particular embodiment or application. 
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These and other advantages and features, which 
characterize the invention, are set forth in the. claims 
annexed hereto and forming a further part hereof. 
However, for a better understanding of the invention, and 
5 of the advantages and objectives attained through its use, 
reference should be made to the Drawing, and to the 
accompanying descriptive matter, in which there is 
described illustrated embodiments of the invention. 
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Brief Description of the Drawing 

FIGURE 1 is a flowchart illustrating the operation of 
an agent manager in selecting one of a plurality of 
intelligent agents with varied degrees of domain knowledge 
5 to perform a computer task, consistent with the invention. 

FIGURE 2 is a flowchart illustrating the operation of 
an agent manager in selecting within an intelligent agent 
one or more of a plurality of program modules with varied 
degrees of domain knowledge to perform a computer task, 
10 consistent with the invention* 

FIGURE 3 is a flowchart illustrating the operation of 
an intelligent agent in selecting one or more of a 
plurality of program modules therein having varied degrees 
of domain knowledge to perform a computer task, consistent 
15 with the invention. 

FIGURE 4 is a block diagram of a networked computer 
system environment for use with the illustrated 
embodiments of the invention. 

FIGURE 5 is a block diagram of the networked computer 
20 system of Fig. 4, illustrating the interaction between 
intelligent agents therein. 

FIGURE 6 is another block diagram of the networked 
computer system of Fig. 4, illustrating in greater detail 
the primary components in the client and remote computer 
25 systems. 

FIGURE 7 is a block diagram of an exemplary agent 
manager and reinforcement learning module consistent with 
the invention, and for use in the selection of one of a 
plurality of agents having varied degrees of domain 
30 knowledge . 

FIGURE 8 is a flowchart illustrating the operation of 
a semi-autonomous agent consistent with invention. 

FIGURE 9 is a flowchart illustrating the operation of 
a fully-dependent agent consistent with invention. 
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FIGURE 10 is a flowchart illustrating the operation 
of a fully- autonomous agent consistent with invention. 

FIGURE 11 is a block diagram of an exemplary 
intelligent agent having multiple program modules of 
varied degrees of domain knowledge. 

FIGURE 12 is a block diagram of another exemplary 
intelligent agent constructed from program modules from 
one or more pools of program modules. 
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Detailed Description of the Illustrated Embodiments 

The embodiments of the invention are principally 
directed to optimizing the performance of computer tasks 
by intelligent agents in a wide variety of situations, 
5 Intelligent agents are computer programs which have 

been delegated a degree of autonomy but which are limited 
to operating within constraints defined by their client 
(which may be, for example, another agent; a computer 
program, application or system; or an individual 

10 interacting with the agent through a computer — 

hereinafter a client computer process) . A subset of such 
agents which are capable of being passed between and 
operating in different applications or computer systems 
are referred to as mobile agents* In certain 

15 circumstances, the functions of a mobile agent, combined 
with the functions of an agent manager program that is 
resident in a client's computer system and interacts with 
the mobile agent, may cooperatively be considered as 
portions of a single intelligent agent where the domain 

20 knowledge has been allocated between home and mobile 

portions. However, as used herein, the term agent may be 
considered to include simply a portion or module of an 
agent resident in a computer system irrespective of 
whether any portion of the functionality for the agent as 

25 a whole is resident on another computer system or in 
another computer process . 

In general, an agent has the ability to sense, 
recognize and act. These functions are typically embodied 
in a number of components, including an engine, a 

30 knowledge component, an adapters component, a library, and 
a view component. Generally, the engine controls the 
overall operation of an agent and functions as the 
"brains" of the agent, and the knowledge component stores 
information that is representative of the acquired 
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knowledge of the agent. The adapters component handles 
interaction between an agent and external objects, thereby 
functioning as the mechanism through which the agent 
"senses" and interacts with its environment. A library 
5 persistently stores the information utilized by the 

knowledge component, and the view component provides the 
human interface, if any, for an agent, e.g., for supplying 
instructions to the agent. 

Moreover, as discussed above, agents may have varied 

10 degrees of domain knowledge -- that is, intelligence 

and/or skills applied to a specific field or function, 
typically applied to a particular computer task. This may 
be exhibited through greater or lesser degrees of autonomy 
or authority, alternate processing strategies, different 

15 types and/or amounts of stored information, different 
sources of information, different capabilities (e.g., 
monitoring functionality vs. monitoring and response 
functionality), different goals or objective functions, 
different inputs or sensed values to which an agent may be 

20 responsive, alternate logic implementations (e.g., neural 
networks vs. procedural logic), etc. 

For example, 5 in the illustrated embodiments discussed 
in greater detail below, domain knowledge is represented 
in part by the degree of autonomy granted to an agent to 

25 conduct negotiations in electronic commerce applications 

as well as the inputs to which the agent is responsive and 
the knowledge present in the agent. In addition, in other 
electronic commerce applications, domain knowledge may 
represent different manners or strategies for achieving a 

30 desired goal, e.g., optimizing a negotiation strategy for 
one agent to maximize profit, while optimizing a strategy 
for another agent to maximize the number of completed 
transactions. Domain knowledge in such applications may 
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also be market dependent , e.g., where agents are optimized 
for automobile markets, stock markets, etc. 

As another example, domain knowledge may represent a 
particular indexing or searching algorithm used by a data 
5 mining agent, e.g., where one particular searching 

algorithm may work better for legal cases while another 
may work better for technical documents. In other data 
mining applications, domain knowledge for one type of 
agent may include only the ability to monitor for 

10 problems, e.g., looking for low stock or late deliveries 
in an inventory monitoring agent. In other agents, the 
domain knowledge may also include the ability to respond, 
e.g., in the above inventory application, ordering low 
stock items, ordering alternates for late deliveries, or 

15 canceling orders when sales are low. 

Domain knowledge may also simply represent different 
manners of performing the same or similar general task, 
e.g., for the purpose of disguising the identity of the 
agents' client, since differently-configured agents may 

20 present different characteristics when scanned by third 
parties. For example, this may permit differently- 
configured agents to be dispatched from a single client to 
conduct negotiations with a single party so that the 
agents play off one another to obtain more favorable 

25 results for the client. 

Intelligent agents are typically programmed to handle 
particular computer tasks, which may include practically 
any operation performed on a computer. For example, in 
electronic commerce applications, the computer tasks for 

30 which agents are programmed to handle are typically that 
of conducting negotiations and completing transactions. 
For data mining applications, the computer tasks may be 
the indexing or retrieval of desired information from one 
or more databases . 
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Consistent with the invention, one or more objective 
criteria are used in the selection of one of a plurality 
of agents, or alternatively, one of a plurality of program 
modules in an agent, such that a given computer task is 
5 handled with optimal performance for its given situation. 
The plurality of agents (or program modules) are all 
configured to handle a common general computer task (e.g., 
"buy a car", "get me everything you can find out about 
company X", etc.), although the relative performance, 

10 capabilities, manners, and mechanisms by which the 

different agents (or program modules) operate to handle 
the task may nonetheless differ. 

A program module (or mechanism) may be considered to 
include any function, object, or block of program code 

15 configured to perform a given computer task. A suitable 
objective criteria may represent practically any testable 
or measurable variable or variables that varies in 
different situations to which an agent may be exposed, and 
that may affect an agent's relative performance under 

20 those situations. 

For example, in the illustrated embodiments discussed 
in greater detail below, the objective criteria is the 
perceived risk to an agent dispatched to a remote computer 
system, whether due to known risks or due to a lack of 

25 knowledge as to the risk inherent in an untested 

environment. The objective criteria may also include 
relative compute or transmission speeds, or the perceived 
risk of a particular transmission media, which may be used 
to determine whether domain knowledge related to 

30 disguising or encrypting messages is implemented in a 
dispatched agent. 

Other examples of objective criteria may include the 
source of information for data mining applications (e.g., 
particular domains or databases, or types of data) , user 
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approval or feedback (e.g., a qualitative assessment of 
performance by a user) , and profits or costs for 
electronic commerce applications, among others. 

Selection of agents or program modules within agents 
5 based upon the objective criteria will vary depending upon 
the particular objective criteria and types of situations 
to which the agent (s) may be exposed. For example, for 
the illustrated embodiments discussed in greater detail 
below, the risk to an agent may be tested based upon the 

10 IP address of the remote system to which an agent is to be 
dispatched, and any identifying information about the 
other party to a negotiation, among others. Separate risk 
factors may also be monitored for the transmission media 
and the agent's resident environment such that domain 

15 knowledge specific to each type of risk may be separately 
utilized as necessary. 

For data mining applications, the source of 
information could be tested based upon the IP address of 
the database being searched, the type of information being 

20 sought, etc. In electronic commerce and other 

applications, the reliability of data could also be tested 
by looking at the source thereof (e.g., prices retrieved 
from a bulletin board may be more reliable than prices 
quoted in a magazine article) , and/or by comparing the 

25 similarity of the data to the type of information being 
sought. Also, the volatility of a market or the type, 
quantity or value of particular classes of goods may be 
tested (e.g., a car buying agent may warrant or require a 
more aggressive negotiation strategy than a compact disc 

3 0 buying agent) . 

In addition, selection based upon an objective 
criteria may occur dynamically, and may even be updated 
from time to time. For example, an agent may be 
configured to dynamically select program modules in 
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response to changing circumstances such as changes in 
market conditions, changes in size, speed, or other 
capacity of a remote site, and new input from a client 
(e.g., as related to information that is otherwise 
5 unavailable to an agent while resident in a remote 
system), among others. 

Agents or program modules may be matched with 
particular values of an objective criteria to assist in 
the selection of agents or program modules. The matching 

10 may be as simple as a mapping or table matching 

agents/program modules and values of the objective 
criteria, or may include predictive and interpretive logic 
such as provided by factor weighting, neural networks, 
expert systems, fuzzy logic systems, etc. to attempt to 

15 select the most appropriate agent/program module even when 
no direct match exists for a given value of the objective 
criteria. 

Moreover, selection based upon the objective criteria 
may be adaptively controlled. For example, the actual 

20 results and relative performance of agents dispatched to 
perform particular tasks may be analyzed and used to 
modify the selection of agents for given values of the 
objective criteria. To this extent, a reinforcement 
learning module, implemented for example in a neural 

25 network, may be particularly well suited for use in agent 
or program module selection consistent with the invention. 

As discussed above, the objective criteria may be 
used to select between a plurality of agents, or between a 
plurality of program modules in an agent, that have varied 

30 degrees of domain knowledge. The program flow for 

dispatching one of a plurality of agents based upon an 
objective criteria is illustrated by agent manager 200 in 
Fig. 1. In block 201, a request to perform a task is 
received, and in block 202 the objective criteria is 
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tested by the agent manager. Next, in block 204, an agent 
is selected, based upon the objective criteria, from a 
pool of agents having varied degrees of domain knowledge. 
In block 206, the selected agent is dispatched to perform 
5 its task, and in block 208, the performance of the agent 
upon completion of the task is optionally analyzed if 
adaptive agent selection is utilized. 

Similarly, the program flow for selecting one or more 
of a plurality of program modules based upon an objective 
10 criteria is illustrated by agent manager 210 in Fig. 2. 

In block 211, a request to perform a task is received, and 
in block 212 the objective criteria is tested by the agent 
manager . 

Next, in block 214, one or more program modules are 

15 selected, based upon the objective criteria, from a 

plurality of program modules having varied degrees of 
domain knowledge. In some embodiments, one of a plurality 
of available program modules may be selected. For 
example, program modules may have duplicate functionality 

20 that is implemented in alternate manners. In such 
instances, only one module may be selected. 

In other embodiments, individual program modules may 
have additive or non-overlapping functionality so that 
multiple program modules (e.g., a subset of program 

25 modules) are selected in a given agent. For example, one 
program module may provide basic domain knowledge, with 
additional program modules selected as necessary to add 
additional domain knowledge to more sophisticated agents. 
In addition, the agent may need, to be configured for 

30 proper execution of the appropriate program module or 

modules. For example, if an agent is provided with both 
selected and non-selected program modules (e.g., where 
duplicative functionality is provided by different 
alternative modules) , an agent may be configured by 
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initializing, sending a message or setting a variable in 
the agent specifying which program module or modules to 
execute. In the alternative, an agent may be constructed 
from component parts prior to its dispatch, whereby the 
5 agent may be configured by bundling all of the selected 
program modules with any necessary components that are 
generic to all of an agents' permutations. 

Next, in block 216, the configured agent is 
dispatched to perform its task, and in block 218, the 

10 performance of the agent upon completion of the task is 
optionally analyzed if adaptive selection is utilized. 

It should also be appreciated that the functions of 
testing the objective criteria, selecting agents or agent 
program modules based upon the objective criteria, and/or 

15 analyzing the performance of agents need not be performed 
by an agent manager or other computer process resident in 
the client computer system, but may be performed by any 
suitable evaluation module or program. In addition, 
performance analyzation may also rely on a user' s 

20 subjective assessment. A separate agent may be utilized 
as an evaluation module to perform any or all of these 
functions. Also, any of these functions may be 
implemented in the agent that has been dispatched to 
perform a task, whereby the agent includes an evaluation 

25 module. 

For example, as illustrated in Fig. 3, an agent 220 
may receive a task request in block 221, test the 
objective criteria in block 222, select the appropriate 
program module in block 224 and execute the selected 
30 program module in block 226 to perform the task. In 

addition, the agent may analyze its performance in block 
228 if desired. The above-described configuration has a 
number of benefits. For example, in electronic commerce 
applications, an agent could be dispatched with a task to 
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seek out negotiations on multiple remote systems, whereby 
the agent would have the ability to dynamically determine 
the risk at each remote system and alter its behavior 
accordingly. 

5 

Exemplary Hardware/Software Environment 
A representative hardware environment suitable for 
use with the illustrated embodiments of the invention is 
illustrated in Fig. 4, where a networked computer system 

10 10 generally includes one or more computer systems, e.g., 
single-user computer systems 16, 18 and multi-user 
computer systems 20, 60, coupled through a network 15. 
Multi-user computer system 20 typically includes one or 
more servers 25 to which one or more single-user computers 

15 22 may networked through a separate network 24. 

Similarly, multi-user computer system 20 typically 
includes one or more servers 65 coupled to one or more 
single-user computer systems 62 through a network 64. 
Network 15 may represent any type of networked 

20 interconnection, including but not limited to local-area, 
wide-area, wireless, and public networks (e.g., the 
Internet) . 

It is anticipated that agents consistent with the 
invention may originate in and be resident from time to 

25 time on any of the above-mentioned computer systems. One 
possible distinction between the computer systems for the 
purposes of the invention may be whether each is a client 
or a remote system relative to a particular agent. For 
example, Fig. 5 illustrates an embodiment of computer 

30 system 10 where multi-user computer system 20 is a client 
system, and multi-user computer system 60 is a remote 
system. 

A client system will hereinafter refer to a computer 
system that provides an agent a certain level of security 
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from manipulation by other parties when the agent is 
resident on the system. The client system is also the 
computer system from which management of the agent is 
typically handled. The agent typically but not 
5 necessarily will also originate from the client system. 

A remote system, on the other hand, will hereinafter 
refer to a computer system that is typically not capable 
of providing a desired level of security for an agent, 
generally because the computer system is not under the 

10 control of the client. It is typically while resident on 
a remote system that an agent runs the greatest risk of 
being scanned or reverse compiled, or of having 
communications intercepted or monitored, by other parties. 
The various embodiments described hereinafter have 

15 principal uses in electronic commerce applications, where 
agents are configured to negotiate commercial 
transactions, generally in the role of buying or selling 
agents. The agents may negotiate with other agents, other 
computer systems, or even other individuals. The agents 

20 may interact one-on-one, or may be capable of operating 

within a "market" of multiple agents, along the lines of a 
stock or commodity market. Computer systems having the 
ability to host agents for interaction therebetween 
include negotiating programs of varying sophistication and 

25 are hereinafter referred to as agent hosts. However, it 
should be appreciated that the invention applies equally 
to other applications of intelligent agents, and thus 
should not be limited specifically to electronic commerce 
applications . 

30 Fig. 5 illustrates a mobile intelligent agent 100 

which communicates with an agent manager 32 in client 
system 20. During negotiation with another party such as 
negotiating agent 95, mobile agent 100 is resident on 
remote system 60. It should be appreciated that remote 
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system 60 may be the client for agent 95, or may also be 
considered to be remote relative to this agent as well* 

An exemplary functional design of networked computer 
system 10 for implementing various embodiments of the 
5 invention is illustrated in Fig. 6. Server 25 of client 
system 20 generally includes a central processing unit 
(CPU) 28 coupled to a memory 30 and storage 40 over a bus 
54. A local area network interface is provided at 52, and 
an interface to remote system 60 over external network 15 

10 is provided through interface 50. Agent manager program 
32 is resident in memory 30, as is a reinforcement 
learning module 34 that enables the agent manager to 
adaptively optimize the performance of tasks. Storage 40 
includes one or more agents 42 (of which may include agent 

15 100, for example) , which are computer programs or modules 
that may be retrieved and used locally within system 20, 
or dispatched to remote systems to execute and perform 
tasks on behalf of the client system. Storage 40 also 
includes an agent mission database 44 which may track 

20 agent operations and the relative success or failure 
thereof . 

Server 65 of remote system 60 also includes a CPU 68 
coupled to a memory 70, storage 80, external network 
connection 90 and local network connection 92 over a bus 

25 94. An agent host program 72 is resident in memory 70 to 
handle interactions between agents resident in the remote 
system. Typically, the agent host program is an 
asynchronous message/event driven environment that 
provides a common platform over which agent computer 

30 programs execute and interact, much like an operating 
system. The agent host is also capable of permitting 
messages to be sent between agents and their clients. 
Memory 70 also includes a negotiating program 74 which 
operates as the "other party" in transactions with agent 
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100, which may be another agent, a market or bulletin 
board application, or even an interface program through 
which an individual interacts with agent 100. Storage 80 
maintains a transaction history database 82 which logs the 
5 transactions completed on the server* 

Servers 25, 65 may be, for example, AS/400 midrange 
computers from International Business Machines 
Corporation. However, it should be appreciated that the 
hardware embodiments described herein are merely 
10 exemplary, and that a multitude of other hardware 
platforms and configurations may be used in the 
alternative . 

Moreover, while the invention has and hereinafter 
will be described in the context of fully functioning 

15 computer systems, those skilled in the art will appreciate 
that the various embodiments of the invention are capable 
of being distributed as a program product in a variety of 
forms, and that the invention applies equally regardless 
of the particular type of signal bearing media used to 

20 actually carry out the distribution. Examples of signal 
bearing media include but are not limited to recordable 
type media such as floppy disks, hard disk drives, and CD- 
ROM's, and transmission type media such as digital and 
analog communications links. 

25 

Negotiating Agents with Adaptive Agent Selection 
Fig. 7 illustrates an exemplary configuration of 
client computer system 2 0 for providing adaptive agent 
selection consistent with the invention. Agent manager 32 
30 is coupled to a reinforcement learning module 34 through 
which analysis of agent performance and control of agent 
selection is performed. Agent manager 32 is shown coupled 
to a pool of agents 42 having varied degrees of domain 
knowledge, or alternatively, to one or more pools of 
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program modules 42 ' having varied degrees of domain 
knowledge. A dispatched agent 100 communicates with agent 
manager 32 to conduct negotiations on behalf of the 
client . 

5 One or more information sources 56 are also coupled 

to agent manager 32 to enable it to obtain the factor or 
factors for testing the objective criteria (in this 
embodiment, security or risk) . Information sources 56 may 
include internal databases updated by manager 32 or other 
10 components in client system 20, as well as external 
databases . 

For example, risk may be determined by maintaining a 
database of one or more risk factors for particular 
destinations, addresses or parties being negotiated with, 

15 To this extent, information about a remote system or other 
agents may be obtained by the agent manager, such as a 
name or identification, an operating system, a type of 
communication port, a client, a bank and/or bank account 
number, a homebase address, the name or identification of 

2 0 an agent program, the size of an agent program, where 

messages and other communications with an agent originate, 
and/or the pattern of input/output (I/O) compared to CPU 
cycles for I/O transmissions. Also, the agent manager may 
attempt to retrieve a credit card number or bank account 

2 5 number from an unknown agent and validate the number. 

Moreover, an unknown agent may be scanned and compared to 
other known agents, e.g., comparing the percentage of 
identical code, determining the language the agent was 
written in, or searching for unique patterns in much the 

30 same manner as a virus checking program. 

Risk may also be analyzed over time based upon the 
actual performance of dispatched agents, e.g., by 
monitoring the number of dispatched agents that disappear, 
or the length of time between communications from 



Page 23 

IBM R0996-0052 

MSG 10790. 100US01 
Patent Application 



dispatched agents. Also, risk may be analyzed by 
monitoring dispatched agents for tampering, e.g., by 
performing checksums or cyclic redundancy checks, 
analyzing binary signatures, etc. Either the agent 
manager or the agents themselves may be used to detect any 
tampering. Risk may also be analyzed by monitoring the 
behavior of other parties being negotiated with. 

In addition, risk may be measured based upon the 
profitability of agents. For example, the performance of 
a negotiating agent may be measured in terms of the number 
of complete transactions, gross income of all 
transactions, net profit of all transactions, etc. The 
performance may then be offset by the costs of using the 
agent, e.g., in terms of the time required to complete 
transactions, local/remote CPU charges, communication (on- 
line) charges, etc. If an agent is compromised in some 
way, where another party gains a negotiating advantage, a 
negative impact on profitability would be observed. 

Agent manager 32 provides inputs to reinforcement 
learning module 34 to permit module 34 to perform the 
functions of estimating risk over time and selecting 
agents or agent program modules. Module 34 maintains a 
mapping between agents/program modules and different 
levels or types of risk, so that an appropriate 
agent/program module is selected depending upon the 
measured level or type of risk to a dispatched agent under 
the current circumstances. 

Agent manager 32 provides optimum agent performance 
data related to a desired task for which to test the 
objective criteria. In response, module 34 outputs a 
control action output back to agent manager 32 to control 
which agent or agent program module should be selected and 
dispatched for performing the given task. Also, when a 
dispatched agent is returned and analyzed, agent manager 
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32 provides actual performance data to adaptively modify 
the selection functionality of module 34. 

In the illustrated embodiment, an adaptive heuristic 
critic neural network architecture is utilized in module 
5 34 to provide adaptive reinforcement learning. The 

design, training and operation of an adaptive heuristic 
critic neural network is well known in the art. For 
example, one suitable development environment for 
performing the development and training of such a neural 

10 network is the IBM Neural Network Utility available from 

International Business Machines Corporation, among others. 

Module 34 includes a sensor 36 which compares desired 
(optimum) agent performance and actual agent performance 
and generates a scalar reinforcement reward/penalty value 

15 therefrom. A critic network 37 provides temporal 

difference learning in response to the actual agent 
performance data and the reinforcement value from the 
sensor. Critic network 37 outputs a utility signal that 
is back propagated to a plurality of action networks 38. 

20 An action network 38 is provided for each agent or 

program module. Each action network receives agent 
performance data and outputs therefrom an action merit 
signal to a stochastic action behavior block 39. Block 39 
probabilistically chooses an action (here, which one of 

25 the agents or program modules to dispatch) , and outputs 
therefrom a control action signal representative thereof 
to agent manager 32. Block 39 introduces randomness to 
agent selection to widen its search space, which has been 
found to improve long term performance of the module. 

30 Module 34 adaptively modifies its selection of agents 

based upon the analyzed performance of a dispatched agent. 
If an agent performs well, sensor 36 reinforces critic 
network 37, which in turn reinforces the corresponding 
action network 38 such that it is more likely that the 



Page 25 

IBM R0996-0052 

MSG 10790. 100US01 
Patent Application 



same agent/program module is selected under similar 
circumstances. On the other hand, if an agent performs 
poorly, sensor 36 negatively reinforces critic network 37, 
which in turn reduces its utility output to action 
networks 38 to reduce the likelihood that the same 
agent/program modules is selected under similar 
circumstances . 

In the alternative, other reinforcement learning 
modules and algorithms may be used. For example, other 
neural network architectures, such as Q-Learning neural 
networks, Bayesian networks, dynamic programming 
techniques, among others, may be used. 

Module 34 may be initially trained in a manner known 
in the art, e.g., by using example training records. In 
the alternative, module 34 may simply be initialized with 
predetermined or random networks which act as a starting 
reference point from which the module adaptively learns. 
The former approach often improves the quality of the 
initial decisions made by the module in operation. 

Agent pool 42 includes a plurality of agents in which 
the domain knowledge thereof varies in the degree of 
autonomy delegated thereto during negotiations, the 
internal knowledge therein, and the inputs to which an 
agent is responsive thereto. At one extreme, a simple 
intermediary agent, or fully dependent agent, may have 
little domain knowledge in that the agent merely serves 
the function of passing messages between a client and 
another party to a negotiation, such that authority for 
determining offers, accepting or rejecting offers, 
implementing negotiation strategies, etc. rests fully with 
the client. At the other extreme, a fully autonomous 
agent may be provided with the basic parameters for a 
desired transaction and thereafter entrusted with the 
authority to complete such a transaction free from any 
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additional input on the part of the client. Between these 
extremes, one or more additional agents, referred to as 
semi-autonomous agents, may be entrusted with the ability 
to conduct limited negotiations. One possible semi- 
5 autonomous agent may be authorized to conduct negotiations 
for a limited number of offer iterations, but with final 
authority for approving an agreed-upon transaction still 
resting with the client. Other possible degrees of domain 
knowledge between the extremes are also possible. 

10 Fig. 8 illustrates an exemplary program flow for 

negotiation with a semi-autonomous agent which has 
authority to negotiate with another party with limits on 
the offer amount and the number of iterations (offers made 
by the agent) that the agent can undertake on its own. 

15 The interaction between the agent and an agent manager on 
a client system, and the other party being negotiated with 
(another agent, computer process or system, or individual 
interacting through a computer) , is also illustrated. 
For the semi-autonomous agent, negotiations are 

20 instituted after the agent manager selects the agent from 
the agent pool and dispatches the agent, as illustrated in 
blocks 130 and 131. It should be appreciated that the 
agent may be dispatched with a desired transaction or 
transactions which the client wishes to complete 

25 (including information such as desired quantity, 

description of goods/services to be sold/purchased, etc.), 
as well as any other constraints placed upon the agent by 
the manager (e.g., the number of negotiation iterations to 
process, a maximum or minimum limit on the amount to 

30 offer, etc.). In the alternative, some or all of this 

information may be relayed to the agent at a later time by 
the agent manager. 

Next, in block 132, the agent gathers information 
after it has been dispatched to the remote system. For 
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example, the agent may poll a bulletin board or market for 
outstanding offers, or may attempt to find and initiate 
negotiations with another party, e.g., another computer 
system, another agent, or an individual interacting 
5 through a computer system. The agent may also simply wait 
for incoming offers from other parties. Block 132 also 
performs the functions of processing messages from other 
parties and from the agent manager. 

Next, in response to input received by the agent, a 
10 decision is made in block 134. It is in this block that a 
negotiation strategy of the agent is implemented, whereby 
the agent determines, based upon the input it receives, 
whether and how to proceed in negotiations with another 
party. 

15 Generally, one of three actions may be instituted in 

response to block 134. First, as illustrated by block 
136, a message may be sent to the other party, e.g., 
indicating acceptance or rejection of an outstanding 
offer, issuing an offer or counteroffer, terminating 

20 negotiations, requesting that the other party wait, etc. 
The other party receives the message at block 138 and 
processes it at block 140, typically issuing a response 
message back to the agent. Control then returns to block 
132 to process the response message from the other party. 

25 Second, as illustrated by block 142, the agent may 

decide to send a message back to the agent manager. The 
agent manager then receives the message at block 144, 
processes the message in block 146 and sends a response 
message back to the agent in block 148. In block 150, the 

30 agent receives the message and passes control to block 132 
to process the response message from the agent manager. 
Also, as disclosed in U.S. Patent Application Serial No. 

, entitled "APPARATUS AND METHOD FOR COMMUNICATING 

BETWEEN AN INTELLIGENT AGENT AND CLIENT COMPUTER PROCESS 
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USING DISGUISED MESSAGES" (which has been incorporated by 
reference) , the messages between the agent and its manager 
may be disguised to impede scrutiny by third parties. 
Third, as illustrated by block 152, the agent may 
5 decide to leave the remote system and return to its client 
system, e.g., after completion of a successful transaction 
or after termination of an unsuccessful negotiation. An 
agent may x> leave" when its program code is moved or 
transmitted from the remote system to the client system. 

10 In the alternative, an agent may transmit any necessary 

information (e.g., domain knowledge and/or results) to the 
client system and terminate on the remote system. 

The return of the agent is illustrated in block 154. 
Then, in block 156, the performance of the agent may be 

15 analyzed in the manner discussed above. 

Fig. 9 illustrates an exemplary program flow for 
negotiation with a fully-dependent agent which acts 
principally as an intermediary with no authority to 
negotiate with another party without specific instructions 

2 0 from the agent manager. As with the semi-autonomous 

agent, negotiations are instituted after the agent manager 
selects the agent from the agent pool and dispatches the 
agent, as illustrated in blocks 160 and 162. Next, in 
block 164, the agent gathers information after it has been 

25 dispatched to the remote system. Block 164 also performs 
the functions of processing messages from other parties. 

Next, in response to input received by the agent 
(typically as a result of an offer being received from 
another party) , a message is sent to the agent manager in 

30 block 166. The agent manager then receives the message at 
block 168, processes the message in block 170 and sends a 
response message back to the agent in block 172. As with 
the semi-autonomous agent, messages between the agent and 
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agent manager may be disguised to impede scrutiny by third 
parties . 

Next, in block 174, the agent receives the message 
and passes control to block 176 to process the instruction 
5 provided in the response message. Generally, one of two 
actions may be instituted in response to block 176. 
First, as illustrated by block 178, a message may be sent 
to the other party, e.g., indicating acceptance or 
rejection of an outstanding offer, issuing an offer or 

10 counteroffer, terminating negotiations, requesting that 
the other party wait, etc. The other party receives the 
message at block 180 and processes it at block 182, 
typically issuing a response message back to the agent. 
Control then returns to block 164 to process the response 

15 message from the other party. 

Second, as illustrated by block 184, the agent may 
decide to leave the remote system and return to its client 
system, e.g., after completion of a successful transaction 
or after termination of an unsuccessful negotiation. The 

20 return of the agent is illustrated in block 186. Then, in 
block 188, the performance of the agent may be analyzed in 
the manner discussed above. 

Fig. 10 illustrates an exemplary program flow for 
negotiation with a fully- autonomous agent which has full 

25 authority to conduct negotiations with little or no 
supervision by the agent manager. As with the semi- 
autonomous agent, negotiations are instituted after the 
agent manager selects the agent from the agent pool and 
dispatches the agent, as illustrated in blocks 190 and 

30 191. Next, in block 192, the agent gathers information 

after it has been dispatched to the remote system. Block 
192 also performs the functions of processing messages 
from other parties. 
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Next, in response to input received by the agent 
(typically as a result of an offer being received from 
another party), control passes to block 193 to make a 
decision based upon the inputs to the agent. It is in 
this block that a negotiation strategy of the agent is 
implemented, whereby the agent determines, based upon the 
input it receives, whether and how to proceed in 
negotiations with another party. 

Generally, one of two actions may be instituted in 
response to block 193. First, as illustrated by block 
194, a message may be sent to the other party, e.g., 
indicating acceptance or rejection of an outstanding 
offer, issuing an offer or counteroffer, terminating 
negotiations, requesting that the other party wait, etc. 
The other party receives the message at block 195 and 
processes it at block 196, typically issuing a response 
message back to the agent. Control then returns to block 
192 to process the response message from the other party. 

Second, as illustrated by block 197, the agent may 
decide to leave the remote system and return to its client 
system, e.g., after completion of a successful transaction 
or after termination of an unsuccessful negotiation. The 
return of the agent is illustrated in block 198. Then, in 
block 199, the performance of the agent may be analyzed in 
the manner discussed above. 

Any number of negotiation strategies and features may 
be incorporated into any of the agents in agent pool 42. 

For example, U.S. Patent Application Serial No. , 

entitled ^ INTELLIGENT AGENT WITH NEGOTIATION CAPABILITY 
AND METHOD OF NEGOTIATION THEREWITH", which has been 
incorporated by reference, discloses a suitable 
negotiation strategy which relies on the previous and 
current offers from the agent as well as the previous and 
current prices offered by the other party. A number of 
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additional negotiation features implemented in the 
aforementioned application may also be used, including 
dynamic value determination, disguising negotiation 
strategy by randomizing an agent's behavior, and limiting 
unproductive negotiations by constraining an agent's 
behavior, among others. Other negotiation strategies and 
features may also be used consistent with the invention. 

Moreover, any of the agents in agent pool 42 may be 
implemented using neural networks for implementing 
decision making and/or message disguising, as disclosed in 

U.S. Patent Application Serial No. , entitled 

XA APPARATUS AND METHOD FOR COMMUNICATING BETWEEN AN 
INTELLIGENT AGENT AND CLIENT COMPUTER PROCESS USING 
DISGUISED MESSAGES'' (which has been incorporated by 
reference) . For example, for a semi-autonomous agent, a 
neural network may be configured to receive the following 
inputs : 

• other party's last price 

• other party' s current price 

• agent's last public offer 

• product characteristics 

• instruction from manager: 

• increase in offer authorized 

• increase in iterations authorized 

• increase in offer not authorized 

• increase in iterations not authorized 

• last message sent back to manager 

In addition, the neural network may be configured to 
generate the following outputs: 

• public price offer 

• message to other party 

• make offer 
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• withdraw offer 

• accept 

• reject 

• counteroffer 
5 • please wait 

• finished 

• message to manager 

• probable acceptable alternative, do you accept 

• iterations exceeded, recommend continue 
10 • iterations exceeded, recommend withdrawal 

• recommend immediate withdrawal 

• dummy transmission to motivate negotiation 

• request approval to increase offer 

For a fully-dependent agent, a neural network may be 
15 configured to receive the following inputs: 

• other party' s last price 

• other party' s current price 

• number of iterations 

• agent's last public offer 
20 • product characteristics 

• instruction from manager: 

• buy or sell 

• withdraw 

• counteroffer 

25 In addition, the neural network may be configured to 

generate the following outputs: 

• public price offer 

• message to other party 

• make offer 

30 • withdraw offer 
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• accept 

• reject 

• counteroffer 

• finished 

5 • message to manager 

• current price offer 

• other party' s current price 

• feature presence 

For a fully- autonomous agent, a neural network may be 
10 configured to receive the following inputs: 

• other party's last price 

• other party' s current price 

• number of iterations 

• agent's last public offer 
15 • product characteristics 

In addition, the neural network may be configured to 
generate the following outputs: 

• public price offer 

• message to other party 
20 • make offer 

• withdraw offer 

• accept 

• reject 

• counteroffer 
25 • finished 

Moreover, for training purposes, additional 
information may also be used in the generation of suitable 
training records for any of the above agents, although 
such information is not directly input to the neural 
30 network. Such information may include: 

• desired price and features 
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• agent autonomy price range (range of prices within 
which agent is authorized to negotiate 
autonomously) 

• manager approval price range (range of prices 

5 within which agent requires approval from manager) 

• acceptable iterations allowed without approval 
It should be appreciated that other agents 

implementing alternate negotiation strategies and decision 
making processes, and with other degrees of autonomy 

10 between fully- autonomous and fully-dependent, may be used 
in the alternative. Moreover, it should be appreciated 
that other functionality may be required for the 
autonomous operation of any of the agents in agent pool 
42, and that such additional functionality may be 

15 implemented via procedural logic and/or neural networks. 
For example, functions such as initialization, 
maintenance, finding other agents or markets to interact 
with, etc. may also be utilized. However, as these 
functions relate more to the basic operation of an agent, 

20 which is in general known in the art, these functions will 
not be discussed in any greater detail herein. 

Referring again to Fig. 7, as discussed above, agent 
manager 32 may select from multiple program modules of 
varied degrees of domain knowledge to perform a desired 

25 computer task. Agent manager 32 may utilize an agent from 
pool 42 having multiple program modules with varied 
degrees of domain knowledge, and then configure the agent 
to execute a selected program module based upon the 
ob j ective criteria . 

30 For example, as shown in Fig. 11, agent manager 32 

may utilize an agent 300 that includes a main module 302 
which oversees the operations of the agent, and a 
communications module 304 that handles communications 
between the agent and its manager, and between the agent 

35 and other parties. Moreover, one or more additional 
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program modules 306-308 are interconnected to main module 
302 to provide alternate functionality for the agent. 
Program module 30 6 implements a semi-autonomous 
functionality which operates in the manner disclosed above 
5 for the semi-autonomous agent of Fig. 8. Program module 
307 implements a fully-dependent functionality which 
operates in the manner disclosed above for the fully- 
dependent agent of Fig. 9. Program module 308 implements 
a fully- autonomous functionality which operates in the 
10 manner disclosed above for the fully- autonomous agent of 
Fig. 10. 

Selection of which of modules 306-308 may be made by 
agent manager, in the manner disclosed above with 
reference to Fig. 7. In the alternative, selection of 

15 program modules based upon an objective criteria may be 
made internally within the agent, e.g., using 
predetermined selection criteria (e.g., to employ 
different negotiation strategies depending upon the time 
of day, etc.), or using an optional reinforcement learning 

20 module 309 which operates in a similar manner to module 34 
discussed above. It should be appreciated that when 
selection is performed within an agent, selections may be 
updated from time to time to adapt the behavior of the 
agent in view of changing circumstances. 

25 As also illustrated in Fig. 7, agent manager 32 may 

rely on one or more pools of program modules 42' to 
construct a task-specific agent depending upon an 
objective criteria. For example, as shown in Fig. 12, 
agent manager 32 may construct an agent 310 from a number 

30 of components. First, components which are generic to all 
permutations of an agent may be utilized, e.g., a main 
module 312 and communications module 314. 

Second, components may be selected from pluralities 
of alternate program modules. For example, in one 
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embodiment where the decision logic of an agent is 
implemented in a neural network, a pool 320 of decision 
logic neural network modules may be provided, with another 
pool 322 of price range filters used to optimize the 
5 operation of the neural network modules for different 

price ranges. The use of the latter pool 322 permits the 
modules in pool 32 0 to be price independent so that, when 
agent manager 32 is required to dispatch an agent to 
conduct transactions for a particular product in a 

10 particular price range, the agent manager merely selects 

an appropriate decision logic network module from pool 320 
(e.g., semi-autonomous program module 316) and price range 
filter from pool 322 (e.g., price range C filter 318). 
Other manners of customizing the modules in pool 320 may 

15 be envisioned, e.g., selecting from different requirements 
filters to optimize an agent for different products. 
Also, both procedural and neural network implemented 
decision logic modules may be included in pool 320 so 
that, depending upon the risk to a dispatched agent, 

20 either type of logic may be utilized. 

Third, instead of selecting among alternate program 
modules, agent manager 32 may add additive program modules 
to enhance the domain knowledge of the dispatched agent. 
For example, in one embodiment, a pool of optional modules 

25 324 may be provided with, among other modules, a time - 
monitor module 319 that permits the negotiation strategy 
of the agent to vary depending upon the time of day (e.g., 
for markets which tend to be more volatile at the end of 
the day, module 319 may implement a more conservative 

30 negotiation strategy than otherwise employed) . Other 

optional functionality may be implemented. For example, 
agent performance monitoring may optionally be included 
within agent 310 if desired. 
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Any number of programming languages and underlying 
platforms may be utilized to implement agents consistent 
with the invention. For example, Java and other object 
oriented programming languages are particularly well 
5 suited for use with agents implementing multiple program 
modules having varied degrees of domain knowledge, as 
separate classes implementing each of the alternate 
program modules may be interconnected with one another in 
a single package, 
10 Other modifications may be made to the illustrated 

embodiments without departing from the spirit and scope of 
the invention. Therefore, the invention lies solely in 
the claims hereinafter appended. 
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WHAT IS CLAIMED IS: 

1 1. A method of handling a computer task using an 

2 intelligent agent, the method comprising the steps of: 

3 (a) based upon an objective criteria, selecting 

4 at least one selected program module from a plurality 

5 of program modules having varied degrees of domain 

6 knowledge, wherein the plurality of program modules 

V are configured to handle a common computer task; and 

8 (b) configuring an intelligent agent to execute 

9 the at least one selected program module to handle 
1° the computer task. 

1 2. The method of claim 1, wherein the intelligent 

2 agent includes only the selected program module from the 

3 plurality of program modules, and wherein the configuring 

4 step includes the step of constructing the intelligent 

5 agent using the selected program module. 

1 3. The method of claim 1, wherein the intelligent 

2 agent includes each of the plurality of program modules, 

3 and wherein the configuring step includes the step of 

4 configuring the intelligent agent to execute only the 

5 selected program module to handle the computer task. 

1 4. The method of claim 1, wherein the selecting step 

2 is performed by the intelligent agent. 

1 5. The method of claim 1, wherein the selecting step 

2 is performed by an agent manager. 

1 6. The method of claim 1, wherein the plurality of 

2 program modules are additive program modules, and wherein 

3 the selecting step includes the step of selecting a subset 
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4 of the plurality of program modules to handle the computer 

5 task. 

1 7. The method of claim 1, wherein the plurality of 

2 program modules are alternative program modules, and 

3 wherein the selecting step includes the step of selecting 

4 only one of the plurality of program modules to handle the 

5 computer task. 

1 8. The method of claim 1, wherein the selecting step 

2 includes the step of adaptively selecting the selected 

3 program module using a reinforcement learning algorithm. 

1 9. The method of claim 8, further comprising the 

2 steps of: 

3 (a) obtaining performance information relating 

4 to the performance of the selected program module in 

5 handling the computer task; and 

6 (b) supplying the performance information to 

7 the reinforcement learning algorithm. 

1 10. The method of claim 8, wherein" the reinforcement 

2 learning algorithm is implemented in an adaptive heuristic 

3 critic neural network. 

1 11. The method of claim 1, wherein the 'selecting 

2 step includes the steps of: 

3 (a) matching each of the plurality of program 

4 modules with a value of the objective criteria; 

5 (b) determining a selected value of the 

6 objective criteria; and 

7 (c) selecting as the selected program module a 

8 program module matching the selected value of the 

9 obj ective criteria . 
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1 12. The method of claim 11, wherein the selecting 

2 step further includes the step of retrieving information 

3 for a selected computer task, wherein the determining step 

4 determines the selected value of the objective criteria 

5 using the retrieved information. 

1 13. The method of claim 1, wherein the intelligent 

2 agent is configured to conduct negotiations in an 

3 electronic commerce application, and wherein the domain 

4 knowledge for each of the plurality of program modules is 

5 related to the autonomy delegated thereto. 

1 14. The method of claim 13, wherein the plurality of 

2 program modules includes a semi-autonomous program module, 

3 a fully-autonomous program module, and a fully-dependent 

4 program module. 

1 15. The method of claim 13, wherein the objective 

2 criteria includes a risk that a dispatched agent is 

3 subjected to in negotiations. 

1 16. An apparatus for handling a computer task, 

2 comprising: 

3 an intelligent agent including at least one of a 

4 plurality of program modules having varied degrees of 

5 domain knowledge, wherein the plurality of program 

6 modules are configured to handle a common computer 

7 task, and wherein, based upon an objective criteria, 

8 at least one selected program module from the 

9 plurality of program modules is selected to handle 
10 the computer task. 



Page 41 

IBM R0 996-0 052 

M&G 10790. 100US01 
Patent Application 



42 

1 17. The apparatus of claim 16, further comprising an 

2 evaluation module configured to select the selected 

3 program module based upon the objective criteria, 

1 18. The apparatus of claim 17, further comprising a 

2 reinforcement learning module, coupled to the evaluation 

3 module and configured to adaptively select program modules 

4 based upon the performance of the plurality of program 

5 modules in handling the computer task. 

1 19. The apparatus of claim 18, wherein the 

2 reinforcement learning module comprises an adaptive 

3 heuristic critic neural network. 

1 20. The apparatus of claim 17, wherein the 

2 evaluation module is configured to retrieve information 

3 for a selected computer task, determine a selected value 

4 for the objective criteria for the selected computer task, 

5 and select as the selected program module one of the 

6 plurality of program modules which is matched with the 

7 selected value of the objective criteria. 

1 21. The apparatus of claim 17, wherein the 

2 evaluation module is implemented in an agent manager. 

1 22. The apparatus of claim 17, wherein the 

2 evaluation module is implemented ii> the intelligent agent. 

1 23. The apparatus of claim 17, wherein the 

2 intelligent agent includes only the selected program 

3 module from the plurality of program modules, and wherein 

4 the evaluation module is configured to construct the 

5 intelligent agent using the selected program module. 
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6 24. The apparatus of claim 17, wherein the 

7 intelligent agent includes each of the plurality of 

8 program modules, and wherein the evaluation module is 

9 configured to execute only the selected program module to 
10 handle the computer task. 

1 25. The apparatus of claim 17, wherein the plurality 

2 of program modules are additive program modules, and 

3 wherein the evaluation module is configured to select a 

4 subset of the plurality of program modules to handle the 

5 computer task. 

1 26. The apparatus of claim 17, wherein the plurality 

2 of program modules are alternative program .modules, and 

3 wherein the evaluation module is configured to select only 

4 one of the plurality of program modules to handle the 

5 computer task. 

1 27. The apparatus of claim 16, wherein the 

2 intelligent agent is configured to conduct negotiations in 

3 an electronic commerce application, and wherein the domain 

4 knowledge for each of the plurality of program modules is 

5 related to the autonomy delegated thereto. 

1 28. The apparatus of claim 27, wherein the plurality 

2 of program modules includes a semi-autonomous program 

3 module, a fully- autonomous program module, and a fully- 

4 dependent program module. 

1 29. The apparatus of claim 27, wherein the objective 

2 criteria includes a risk that a dispatched agent is 

3 subjected to in negotiations. 
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1 30. A program product comprising: 

2 (a) a program configured to perform a computer 

3 task using an intelligent agent, the program 

4 comprising an intelligent agent including at least 

5 one of a plurality of program modules having varied 

6 degrees of domain knowledge, wherein the plurality of 

7 program modules are configured to handle a common 

8 computer task, and wherein, based upon an objective 

9 criteria, at least one selected program module from 

10 the plurality of program modules is selected to 

11 handle the computer task; and 

12 (b) a signal bearing media bearing the program. 

1 31. The program product of claim 30, wherein the 

2 signal bearing media is transmission type media. 

1 32. The program product of claim 30, wherein the 

2 signal bearing media is recordable media. 

1 33. A method of handling a computer task on a remote 

2 computer system using an intelligent agent, the method 

3 comprising the steps of: 

4 (a) determining a risk for the remote computer 

5 system; 

6 (b) based upon the risk for the remote computer 

7 system, selecting at least one selected program 

8 module from a plurality of program modules having 

9 varied degrees of domain knowledge, wherein the 

10 plurality of program modules are configured to handle 

11 a common computer task in the remote computer system; 

12 and 

13 (c) configuring an intelligent agent to execute 

14 the at least one selected program module to handle 

15 the computer task. 
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1 34. The method of claim 33, further comprising the 

2 step of matching each of the plurality of program modules 

3 with a risk level. 

1 35. The method of claim 34, wherein the matching 

2 step includes the step of adaptively matching each program 

3 module based upon the actual performance of the plurality 

4 of program modules. 
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APPARATUS AND METHOD FOR OPTIMIZING THE PERFORMANCE OF 
COMPUTER TASKS USING INTELLIGENT AGENT WITH MULTIPLE 
PROGRAM MODULES HAVING VARIED DEGREES OF DOMAIN KNOWLEDGE 

5 

Abstract of the Disclosure 

The performance of a given task is optimized by 
utilizing an intelligent agent having a plurality of 
program modules suited to perform the computer task but 

10 having varied degrees of domain knowledge. Based upon an 
objective criteria that may be determined for a given 
situation, one or more of the program modules in the 
intelligent agent may be selected to perform the task, 
thereby optimizing the performance of the computer task 

15 for a wide variety of situations. 
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claim that is cancelled or withdrawn from consideration need not be submitted if the information is not material to the patentability of any 
claim remaining under consideration in the application. There is no duty to submit information which is not material to ; the -Vf**** of 
anv existing claim The duty to disclose all information known to be material to patentability is deemed to be satisfied if all information 
known to be material to patentability of any claim issued in a patent was cited by the Office or submitted to the Offic f ^ ™ r 
described bv 1 97(b)-(d) and 1.98. However, no patent will be granted on an application m connection with which fraud on the Office 
was practiced or attempted or the duty of disclosure was violated through bad faith or intentional misconduct. The Office encourages 
applicants to carefully examine: 

(1) prior art cited in search reports of a foreign patent office in a counterpart application, and 

(2) the closest information over which individuals associated with the filing or prosecution of a patent application believe any 
pending claim patentably defines, to make sure that any material information contained therein is disclosed to the Office. 

(b) Under this section, information is material to patentability when it is not cumulative to information already of record or being 
made of record in the application, and 

1 (1) It establishes, by itself or in combination with other information, a prima facie case of unpatentability of a claim; 

l = : (2) It refutes, or is inconsistent with, a position the applicant takes in: 
S (i) Opposing an argument of unpatentability relied on by the Office, or 
^ (ii) Asserting an argument of patentability. 

©Drima facie case of unpatentability is established when the information compels a conclusion that a claim is unpatentable under the 
b^^^TeSSTb^if-pioof standard, giving each term in the claim its broadest reasonable construction consisten with the 
PS^S^c^ration is given to evidence which may be submitted in an attempt to establish a contrary conclusion of 

patentability. 

8 (c) Individuals associated with the filing or prosecution of a patent application wimm the meaning of this section are: 

(1) Each inventor named in the application: 

(2) Each attorney or agent who prepares or prosecutes the application; and 

(3) Every other person who is substantively involved in the preparation or prosecution of | the application and who is associated with 
the inventor, with the assignee or with anyone to whom there is an obligation to assign the application. 

(d) Individuals other than the attorney, agent or inventor may comply with this section by disclosing information to the attorney, 

agent, or inventor. 
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